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A DNA-binding peptide comprising three zinc-fingers has been engineered to bind specifically 
to a unique nine-base-pair region of a BCR-ABL fusion oncogene in preference to the parent 
genomic sequences. Binding to the target oncogene in chromosomal DNA is possible in trans- 
formed cells in culture, and results in blockage of transcription. Consequently, murine cells 
rendered independent of growth factors by the action of the oncogene revert to factor depend- 
ence upon transient transfection with a vector expressing the peptide. 



The incorporation of proteins designed to recognize and bind 
specific DNA sequences into chimaeric transcription factors, 
recombinases and nucleases, for example, should have a wide 
range of applications. Wc have shown that zinc-finger mini- 
domains can discriminate between closely related DNA triplets, 
and have proposed thai they can be linked together to form 
domains for the specific recognition of longer DNA sequences 1 ' 2 . 
One interesting possibility for the use of such protein domains 
is to target selectively genetic differences in pathogens or trans- 
formed cells. Here we report the first such application, in which 
we have built a protein which recognizes a specific DNA 
sequence both in vitro and in vivo. 

There is a set of human leukaemias in which a reciprocal 
chromosomal translocation t(9; 22) (q34;ql 1) results in a trun- 
cated chromosome 22, the Philadelphia chromosome (Ph 1 ) 3 , 
encoding at the breakpoint a fusion of sequences from the c-ABL 
proto-oncogene 4 and the BCR gene 5 . In chronic myelogenous 
leukaemia (CML), the breakpoints usually occur in the first 
intron of the c-ABL gene and in the breakpoint cluster region 
of the BCR gene 6 , and give rise to a gene product, pl\Q BCR ~ ABL % 
of relative molecular mass {M t ) 210,000 (ref. 7). Alternatively, in 
acute lymphoblastic leukaemia (ALL), the breakpoints usually 
occur in the first introns of both BCR and c-ABL*, and result 
in a p\90 BCR ABL gene product (Fig. I) 9 . Facsimiles of these 
rearranged genes act as dominant transforming oncogenes in cell 
culture' and transgenic mice". Like their genomic counterparts, 
the complementary DNAs bear a unique nucleotide sequence at 
the fusion point of the BCR and c-ABL genes, which can be 
recognized at the DNA level by a site-specific DNA-binding 
protein. We have designed such a protein to recognize the unique 
fusion site in the p\9Q OCR ABL cDNA. Our experiments are 
repeated on p2\Q BCk ABL cDNA which serves as a control. The 
exon fusions found in these cDNAs arc obviously distinct from 
the breakpoints in the spontaneous genomic translocations, 
which occur in introns and arc thought to be variable among 
patients. Hence although the design of peptides able to bind 
oncogenes has implications for cancer research, our primary aim 
here is to prove the principle of protein design, and to assess the 



FIG. 1 Nucleotide sequences of the fusion point between BCR and ABL 
sequences in p!90 cDNA. and of the corresponding exon boundaries 
in the BCR and c-ABL genes. Exon sequences are written in capital 
letters and introns in lower-case letters. Top, pl90 BCR ' AOt cONA; middle, 
BCR genomic sequence at junction of exon 1 and intron 1: bottom, 
ABL, genomic sequence at junction of intron 1 and exon 2 (ref. 8). The 
9-bp target sequence in the pl90 nc ""* ut cDNA is underlined, as are 
the homologous sequences in genomic BCR and c-ASL. 



feasibility of in vivo binding to chromosomal DNA in available 
model systems. 

The DNA-binding proteins we create arc composed of class- 
ical zinc-finger motifs : ' u . These small motifs are ideal natural 
building blocks for de novo protein design because they function 
as independent modules 15 , but can be connected by a we!! known 
linker to allow recognition of long, asymmetric DNA 
sequences. Lately it has been possible to isolate zinc-fingers that 
bind to given DNA triplets by selection from phage display 
libraries of randomized zinc-fingers M7,K . The specificity of 
selected fingers is checked by a second selection technique called 
the 'binding-site signature*, in which these fingers are used to 
screen libraries of randomized oligonucleolide-binding sites, thus 
identifying fingers that can specify a unique base triplet 2 . From 
these and other studies' 9 , elements of a recognition code have 
emerged which relate the amino-acid sequence of zinc-fingers to 
their cognate triplet. 

The strategy we use in creating DNA-binding proteins com- 
bines phage display selection and rational design based on the 
available recognition rules. A 9-base-pair (bp) target sequence 
(GCA, GAA, GCC) for a three-zinc-finger peptide was chosen 
which spanned the fusion point of the p!90 sc * ABL cDNA 8 . The 
three triplets forming this binding site were each used to screen 
a zinc-finger phage library over three rounds as described 1 . The 
selected fingers were then analysed by binding-site signatures to 
reveal their preferred triplet, and mutations to improve speci- 
ficity were made to the finger selected for binding to GCA (Fig. 
2). A phage display mini-library of putative BCR- A ^-binding 
three-finger proteins was cloned in fd phage, comprising six poss- 
ible combinations of the six selected or designed fingers ( 1 A, IB; 
2A; 3A, 3B and 3C) linked in the appropriate order. The mini- 
library was screened once with an oligonucleotide containing the 
9 bp BCR-ABL target sequence, to select for light binding clones 
over weak binders and background vector phage. Because the 
library was small, we did not include competitor DNA sequences 
for homologous regions of the genomic BCR and c-ABL genes, 
but instead checked the selected clones for their ability to dis- 
criminate. Wc found that although all the selected clones were 



BCff • ABL TTC CAT GCA CAC CCA C aa GCC CTT CAG CGG CCA 

OCR TTC CAT GGA GAC GCA C gi 939 tic etc acg ccj 

ABL ccc cit tct ctt ccj g AA GCC CTT CAG CGG CCA 
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FIG. 2* Amino-acid sequences of zinc-fingers used in constructing the 
mini-library of putative BCR-ABL binders. Regions of secondary struc- 
ture are underlined below the list; residue positions are given above, 
relative to the first position of the a-helix (position 1). Zinc-finger phages 
were selected from a library of 2.6 x io 6 variants using three DNA- 
binding sites, each containing one of the triplets GCC. GAA or GCA 1 . 
Binding-site signatures (data not shown) indicate that fingers 1A and 
IB specify the triplet GCC. finger 2A specifies GAA. and the fingers 
selected using the triplet GCA all prefer binding to GCT (ref. 2). These 
include finger 3A, the specificity of which we believed, on the basts of 
recognition rules, could be changed by a point mutation. Finger 3B. 
based on the selected finger 3A. but in which Gin at helical position 
+2 was altered to Ala. should be specific for GCA. Finger 3C is an 
alternative version of finger 3A, in which the recognition of C is mediated 
by Asp + 3 rather than by Thr + 3. 
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FIG. 3 Discrimination in the binding of the anti-BCR-ABL peptide to its 
pl90 BCR ~ AaL target site and to like regions of genomic BCR and c-ABL. 
The graph shows binding (measured as an absorbance between 450 
and 650 nm) at various DNA concentrations. Binding reactions and 
complex detection by enzyme immunoassay were performed as 
described 2 , and a full curve analysis was used in calculations of the K„ 
(ref. 16). The DNAs used were oligonucleotides spanning 9 bp either 
side of the fusion point in the cDNA or the exon boundaries. The anti- 
BCR-ABL peptide binds to its intended target site with K d = 
6.2 ±0.4 x io -7 M, and is able to discriminate against genomic BCR 
and c-ABL sequences, although the latter differs by only one base pair 
in the bound 9-bp region. 
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able to bind the BCR-ABL target sequence and to discriminate 
between this and the genomic BCR sequence, only a subset could 
discriminate against the c-ABL sequence which, at the junction 
between intron 1 and exon 2, has an 8/9 bp homology to the 
BCR-ABL target sequence 8 . Sequencing of the discriminating 
clones revealed two types of selected peptide, one with the com- 
position t A-2A-3B and the other 1 B-2A-3B. Thus both peptides 
carried the third finger (3B) which was specifically designed 
against the triplet GCA, but peptide 1 A-2A-3B was able to bind 
to the BCR-ABL target sequence with higher affinity than was 
peptide 1 B-2A-3B. 

The peptide 1A-2A-3B, which we will refer lo as the anti- 
BCR-ABL peptide, was used in further experiments. The anti- 
BCR-ABL peptide has an apparent equilibrium dissociation 
constant (K d ) of 6.2±0.4x 10" 7 M for the p\90 nc * AltL cDNA 
sequence in vitro, and discriminates against the similar sequences 
found in genomic BCR and c-ABL DNA by factors greater 
than an order of magnitude (Fig. 3). The measured dissociation 
constant is higher than that of thrcc-fingcr peptides from natu- 
rally occurring proteins such as Spl (ref. 20) or Zif268 (ref. 
21 ), which have K tl values in the range of 10~ y M, but rather is 
comparable to that of I he two fingers from the tram track (ttk) 
prolcin 3 \ However, the affinity of the anti-BCR -ABL peptide 



could be refined if necessary by site-directed mutations or by 
'affinity maturation' of a phage display library 23 . 

Having established DNA discrimination in vitro, we wished 
to test whether the anti-BCR-ABL peptide was capable of site- 
specific DNA binding in vivo. The peptide was fused to the 
VP 1 6 activation domain from herpes simplex virus 24 and used 
in transient transfection assays (Fig. 4) to drive production of 
a chloramphenicol acetyl transferase (CAT) reporter gene from 
a binding site upstream of the TATA box 25 . A thirtyfold increase 
in CAT activity was observed in cells cotransfected with reporter 
plasmid bearing copies of the pl90 sc *""* cDNA target site, 
compared to a barely detectable increase in cells cotransfected 
with reporter plasmid bearing copies of either the BCR or c- 
ABL semihomologous sequences. The selective stimulation of 
transcription indicates convincingly that highly site-specific 
DNA binding can occur in vivo. But although transient transfec- 
tions assay binding lo plasmid DNA, the true target site for this 
and most other DNA-binding proteins is in genomic DNA. This 
might well present significant problems, not least because this 
DNA is physically separated from the cytosol by the nuclear 
membrane, but also because it may be packaged within 
chromatin. 

To study whether genomic targeting is possible, wc made a 
construct in which the anti-BCR-ABL peptide was flanked at 
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FIG. 4 Transaciivation reporter assays Lfsing the anti-BCR-ABL peptide 
fused to the VP16 activation domain. C3H10T1/2 cells were transiently 
cotransfected with a CAT reporter plasmid and an anti-8CR-ABl/VPl6 
expression vector (pZNIA). The top panel shows the results of thin-layer 
chromatography of samples from different transfections. in which the 
fold induction of CAT activity relative to a sample in which reporter 
alonewas transfected (lane 1) is plotted on a histogram below. A specific 
(30-fold) increase in CAT activity was observed in cells cotransfected 
with reporter plasmid bearing copies of the pl90 BCR "* at cDNA target 
site, indicating binding in vivo. The particular constructs used in different 
transfections are noted below the histogram. 

METHODS. Reporter plasmids pMCAT6BA, pMCAT6A and pMCAT6B. 
were constructed by inserting 6 copies of the pl90 eCR *"* BL target site 
(CGCAGAAGCC), the c-ABL second exon-intron junction sequence 
(TCCAGAAGCC). or the BCR first exon-intron junction sequence 
(CGCAGGTGAG). respectively, into pMCAT3 (ref. 33). The anti-BCR-ABL/ 
VP16 expression vector was generated by inserting the in-frame fusion 
between the activation domain of herpes simplex virus VP16 (ref. 24) 
and the zinc-finger peptide in the pEF-BOS vector 34 . C3H10T1/2 cells 
were transiently cotransfected with 10 ug reporter plasmid and 10 ug 
expression vector. RSVL 3S . which contains the Rous sarcoma virus long 
terminal repeat linked to luciferase, was used as an internal control to 
normalize for differences in transfection efficiency. Cells were trans- 
fected by the calcium phosphate precipitation method and CAT was 
assayed as described 36 . Plasmid pG5EC. which has five consensus 17- 
mer GAL4-binding sites upstream from the minimal promoter of the 
adenovirus Elb TATA box, and pMlVP16 vector, which encodes an in- 
frame fusion between the DNA-binding domain of GAL4 and the activa- 
tion domain of herpes simplex virus VP16, were used as a positive 
control 37 . 







FIG. 5 Immunofluorescence microscopy in the absence of IL-3 of Ba/ 
F3 + pl90 and 8a/F3 + p210 cells transiently transfected with the anti- 
BCR-ABL expression vector and stained with the 9E10 antibody. The 
images show untransfected cells (a), and transfected cells in which the 
anti-BCR-ABL peptide is localized in the nucleus (b, c and d). 24 h 
after IL-3 withdrawal, transfected Ba/F3 + pl90 cells show chromatin 
condensation and nuclear fragmentation into small apoptotic bodies 
(seen in various stages in b and in c, where the effect is more pro- 
nounced). In contrast, neither untransfected Ba/F3 + pl90 cells (a) nor 
transfected Ba/F3 + p210 cells (d) show apoptosis. In b all three types 
of Ba/F3 + pl90 cell are displayed: an untransfected cell (top right); a 
transfected but still intact cell (middle); and a transfected cell showing 
nuclear fragmentation (left). 

METHODS. The anti-BCR-ABL expression vector was generated in the 
pEF-BOS vector 34 , including an 11-amino-acid c-Myc epitope tag 
(EQKLISEEDLN) at the C-terminal end. recognizable by the 9E10 



antibody 27 and the nuclear localization signal (PKKKRKV) of the large 
T antigen of SV40 virus 26 at the N-terminal end. Three glycine residues 
were introduced downstream of the nuclear localization signal as a 
spacer to ensure exposure of the nuclear leader from the folded 
molecule Ba/F3 cells were transfected with 25 jig of the anti-BCR-ABL 
expression construct tagged with the 9E10 c-Myc epitope as 
described 38 . Protein production was analysed 48 h later by immuno- 
fluorescence-labelling in the presence of IL-3. and also 24 h after sub- 
sequent withdrawal of IL-3. Cells were fixed in 4% (w/v) 
paraformaldehyde for 15 min. washed in PBS and permeabilized in 
methanol for 2 min. After blocking in 10% fetal calf serum in PBS for 
30 min. the mouse 9E10 antibody was added. After 30-min incubation 
at room temperature, a fluorescein isothiocyanate (FITC)-conjugated 
goat anti-mouse IgG (SIGMA) was added and incubated for a further 
30 min. Fluorescent cells were visualized using a confocal scanning 
microscope (magnification. x90). 
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FIG. 6 Viability in the absence of 11-3 of transformed Ba/F3 cells 
transiently transfected with a vector expressing anti-BCR-ABL peptide. 
The Ba/F3 cell line is dependent on IL*3 for growth, but becomes 
1L-3-independent when stably transformed by p!9Q 8CR '* BL or p210 eCR " 
^ cONA (ref. 10, and I.S.-G. et a/., manuscript in preparation). A 
proportion of Ba/F3 + pl90 cells corresponding to the proportion 
transfected with the anti-BCRWVBL expression vector (Fig. 5) revert 
to IL-3 dependence, but similarly transfected Ba/F3 + p210 cells are 
unaffected. 

the amino terminus with the nuclear localization signal from the 
large T antigen of SV40 virus 26 , and at the carboxy terminus 
with an 1 1-amino-acid c-Myc epitope tag recognizable by the 
9E10 antibody". This construct was used transiently to transfect 
the interleukin-3 (IL-3)-dependent murine cell line Ba/F3 
(ref. 28), or alternatively Ba/F3 + pl90 and Ba/F3 + p210 cell 
lines previously made IL-3-independent by integrated plasmid 
constructs expressing either pi w OCR ABL or p210* c *"** / \ respec- 
tively (I.S.G. et a/., manuscript in preparation). Immunofluo- 
rescence microscopy was used to visualize ceils stained with 9E10 
antibody followed by a secondary fluorescent conjugate (Fig. 
5). The efficiency of transient transfection of ail cell lines, 
measured as the proportion of immunofluorescent cells in the 
population, was 15-20%. The peptide is efficiently localized to 
the nucleus of all such transfected cells. In the presence of IL- 
3, all cell lines are viable 48 h after transfection. On the other 
hand, when IL-3 is subsequently withdrawn from cell culture, 
over 90% of the transfected Ba/F3 + pi 90 cells become apoptotic 
within 24 h (Fig. 56, c). In contrast, all transfected Ba/F3 + p210 
cells remain intact (Fig. 5d). The anti-BCR-ABL peptide is 
therefore able to restore an IL-3 dependence in the Ba/F3 + pl90 
cells which the oncogene had abolished. 

In another set of experiments in which cell viability in tissue 
culture after IL-3 withdrawal is assayed by Tripan blue exclu- 
sion, 20% of Ba/F3 + pl90 cells are found to have reverted to 
factor dependence and die within 24 h, whereas Ba/F3 + p210 
cells are unaffected (Fig. 6). Northern blots of total cytoplasmic 
RNA from transfected Ba/F3 + pI90 cells indicated levels of 
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METHODS. Cell lines Ba/F3. Ba/F3 + pl90 and Ba/F3 + p210 were 
maintained in DM EM medium supplemented with 10% fetal bovine 
serum. In the case of the Ba/F3 cell jine, 10% WEHI-3B-conditioned 
medium was included as a source of IL-3. After the transfection with 
the anti-BCR-ABL expression vector, cells (5 x io 5 per ml) were washed 
twice in serum-free medium and cultured in DMEM with 10% fetal 
bovine serum without WEHI-3B-conditioned medium. Percentage 
viability was determined by Tripan blue exclusion. Data are' expressed 
as means of triplicate cultures. 

pl90* r * messenger RNA reduced by 15-18% relative to 
untransfected cells. These results correlate well with the propor- 
tion of cells that have become transfected and which die in the 
absence of IL-3 (Figs 5 and 6). The reduction in mRNA level 
thus indicates a transcriptional block imposed by the sequence- 
specific binding of the peptide, which would obstruct the path 
of the polymerase. 

Hence a DNA-binding protein designed to recognize a specific 
DNA sequence in vitro, is active in vivo where, directed to the 
nucleus by an appended localization signal, it can bind its target 
sequence in chromosomal DNA, causing a specific inhibition of 
transcription. The use of a blocking agent in this case to target 
intragenic sequences is reminiscent of antisense oligonucleotide- 
or ribozyme-based approaches to inhibiting. the expression of 
selected genes 29 . Like antisense oligonucleotides, zinc-finger 
DNA-binding proteins can be tailored against genes altered by 
chromosomal translocations or point mutations. Also, like oligo- 
nucleotides that can be designed to repress transcription by 
triple-helix formation in homopurine-homopyrimidine 
promoters* 0 , DNA-binding proteins can bind to various unique 
regions outside genes. But by contrast to nucleic acids, proteins 
can direct gene expression by both up- or down-regulating the 
initiation of transcription when fused to activation or 
repression domains 32 . 

By acting directly on any DNA, and by allowing fusion to a 
variety of protein effectors, tailored site-specific DNA-binding 
proteins have the potential to control expression of specific genes 
and to further the possibility of manipulating the genetic mat- 
erial itself, in medicine and research. □ 
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